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TITLE 

TARGETED GENE REPLACEMENTS IN 
ENTERIC BACTERIA USING LINEAR DNA 
FIELD OF THF. TNTVENTION 
The present invention relates to the field of molecular biology and 
microbiology. More specificaUy, methods have been developed for the 
introduction of site-directed alterations in the Escherichia coli genome using 
linear DNA molecules to generate stable transformants. 

BACKGROUND OF THE INVENTTDN 
With the advent of large scale genome sequencing efforts, enormous 
amounts of sequence information are being made available to the research 
conmiunity. Genome sequencing efforts have been completed for several 
prokaryotes including Escherichia coli. At least 46 genomes of different genera 
of bacteria of industrial, pharmaceutical and agricultural importance, have been 
1 5 completely sequenced and about 70 are in progress 

(http://www.ncbi.nhn.mh.gov/PMGifs/Genomes/bact.htmI). However, only a 
small portion of the genes sequenced firom aU these efforts have been functionally 
characterized. 

The need for discovery and analyses of gene function has spawned several 
new areas of research referred to as functional genomics, proteomics. and 
metaboloraics. Functional genomics seeks to discover gene function once 
nucleotide sequence information is available. Proteomics (the study of protein 
properties such as expression, post-translational modifications, interactions, etc.) 
and metabolomics (analysis of metabolite pools) are fast-emerging fields 
25 complementing functional genomics, that provide a global, integrated view of 
cellular processes. The variety of techniques and methods used m this effort 
mclude the use of bioinformatics, gene-array chips, mRNA differential display, 
disease models, protein discovery and expression, and target validation. The 
ultimate goal of many of these efforts has been to develop high-throughput 
30 screens for genes of unknown function. 

A variety of techniques have been applied to elucidate gene function 
inoludine identifvina th#» inter5<ori««T 

yeast 2-hybrid system (Bolger, G., Methods Mol Biol. 88:101-131 (1998)) and 
transposon tagging which is useful in both microbial and eukaryotic genomes 
(Kumar et al.. Plant Biotechnol. 15:159-165 (1998). The logarithmic increase in 
sequence data has driven the need for high-throughput (HTP) functional genomics 
screens. However, relatively few HTP methods have been developed to date. 
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Traditional methods for the determination of gene function still remain the basis 
of the functional genomics effort. 

Classically, the first and most basic analysis for any gene is to assess the 
phenotype of the organism when the gene of interest is altered or rendered 
5 nonfunctional. The impact of such alterations as well as knockouts on proteome 
or mctabolome can be measured to understand gene function. The "nxxll 
mutations" are often constructed by gene dismption (also called gene knockouts) 
by homologous recombination resulting in allelic exchange or gene replacement. 
Gene disruption vectors are constmcted by recombinant DNA techniques. Upon 

10 transformation into tiie organism the DNA construct with disrupted gene 

integrates at the resident location in the genome by homologous recombination 
and replaces the functional copy of the gene with the nonfunctional dismpted 
gene. Gene disruption vectors are constructed from a genomic clone containing 
the gene of mtercst, using a vector that does not replicate in the host organism to 

15 be mutated. The final dismption construct has 1) a selectable marker inserted m 
and/or around the gene of interest in a manner that eliminates expression of the 
gene of interest or renders non-functional any protein product that is produced, 
and 2) additional unmodified genomic DNA flanking the gene disruption site of 
mterest that helps target integration events to die disruption site. Gene dismption 

20 vectors have been constmcted for microbial and mammalian systems by 

(i) identifying and isolating a genomic clone containing the gene of interest; 

(ii) determining convenient restriction sites within or flanking the gene; 

(iii) subcloning a selectable marker into the gene at the restriction sites thereby 
mactivating the gene; (iv) transforming the altered gene containing die marker mto 

25 the host organism; and (v) screemng the transformant for an altered phenotype 
(see for example Sedivy et al., Froc Natl Acad. Sci, USA 86:227-3 1 (1 989)). 

The above process has worked well for dismption of genes in a range of 
organisms but has several mherent limitations. Furst, a genomic clone of suitable 
length must be isolated firom a suitable library, most often by hybridization or 

30 cloned by PGR mto a suitable plasmid. Second, the methodology is completely 
limited to existing restriction sites which must be compatible with the selectable 

marker to be liajifpH nnri mnct or'Mif 1T1 n/^kci^^t^e* cttmy^U i: *: 

selectable marker makes die gene of interest nonfunctional. Additional restriction 
sites that occur outside the gene of interest can make cloning of the selectable 
35 marker into the genomic DNA very difficult. Thhd, because this process relies on 
traditional restriction analysis and cloning, it is inheientiy a labor intensive, one- 
gene-at-a-time approach. Fourth, cosuppression or quelling (Cogoni et al., EMBO 
J. 15:3153-3163 (1996)), a gene silencing phenomenon seen in fungi and plants 
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can result whea a gene disruption vector leaves the promoter and part of the 5* 
coding region intact. Fifth, these methods often give unpredictable results. For 
example, a desired gene targeting vsdll result in replacement of the functional 
allele with a null mutation results. If, however, non-targeted, ectopic integration 
5 occurs (and this undesired integration often predominates) then the transformant 
harbors both a fimctional and nonfimctional copy of the gene. Yeast and 
Escherichia coli have previously been used for in vivo cloning for double strand 
break repair or gap repair (Oldenber et al.. Nucleic Acids Research 25:45 M52 
(1997); Hua et al., Plasmid 3^:91-96 (1997); Oliner et al., Nucleic Acids Research 

10 21:5192-5197 (1993); Bubecket al.. Nucleic Acids Research 21:3601-3602 

(1 993)). In these repair mechanisms, a restriction enzyme-linearized plasmid is 
cotransformed with a piece of DNA with homology to either side of the gap. This 
method of constmction requires, and is limited by, knowledge of restriction 
enzyme sites in the genomic DNA. Yeast and Escherichia coli can efficiently use 

1 5 this homology to repair the gap and produce a functional circular plasmid. 

Although these methods are effective, they are time consuming, often requiring 
several months of work, and require a detailed endonuclease restriction prior 
to the analysis. Further, these approaches are not easily adapted to the high- 
throughput methodology needed today. 

20 An alternative to the restriction analysis and subcloning methods are 

available where the host organism is capable of efBcient homologous 
recombination (e.g., yeast). While most organisms require a m inimnm of several 
kilobase of homologous sequence to target integration, Saccharomyces cerevisiae 
requires more limited amounts of homology to target integration to the 

25 chromosome. Thus, a selectable marker can be amplified by PGR with targeting 
homology (60 bp on each side) incorporated in the PGR primers, thus avoiding the 
need for restriction analysis and subcloning with a large isolated genomic clone 
(Manivasakam et al.. Nucleic Acids Research 23:2799-2800 (1 995)). This method 
is the basis of an effort by the yeast researchers worldwide to disrupt all 6000+ 

30 genes in this organism. This method is practiced with normal yeast chromosomes 
as well as with artificial chromosomes. Treco et al. (U.S. 5,783,385) teach the 

lH#*nti'fir'a+ir\T* arif^ lOrtl«l+iri-ri rt^+^-r**-!*** PlKTA -A^mvittn-frt -Ai^M vVoAn^ 1«1 
*wv^j.t>»Mvru vr*. MM^wit J-rx i ^ ». .M, C*^i J.* wai.i^j ±x\JX1lX Vl^CUL CU. ilXxWlCU- 

chromosome (YAC) yeast libraries, relying on the homologous recombination of 
transformed non-replicating plasmid DNA with the yeast genome. 
35 Yeast provides a particularly efficient envirormaent for homologous 

recombination events between genetic elements especially using linear DNA, 
although similar methods have been developed in bacterial organisms. For 
example, Zhang et al., {Nat Genet 20:123-128 (1998)) teach the site-specific 
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manipulation of target DNA by homologous recombination between linearized 
and chromosomal DNA (as well as plasmids and large episomes) in 
Escherichia coli sbcA mutants expressing RecE and RecT. These methods are 
usefiil, but require use of special mutant strains of E, coli that reduce the level of 
5 DNA damaging endonucleases which may digest foreign, linearized constructs. 

Murphy, K. C, (J BacterioL 180:2063-2071(1998)) have reported the use 
of A, recombination system (Red) for generatmg gene knockouts in Escherichia 
coli using linearized DNA. However, use of this system for mutagenesis suffers 
from requirement of extra steps for transfer of X Red functions to the host strain 
10 that needs to be mutated. 

Several reports including the above mentioned have demonstrated that in 
the absence of specialized genetic backgrounds, integration of disrupted genes by 
allelic exchange does not occur in Escherichia coli genome. 

El Karoui et al. {Nucleic Acids Res, 27: 1296-1299 (1999), and 8*** Int'L 
1 5 Symp. on the Genetics of Industrial Microorganisms, 28 June 1 998) report that 
gene replacements using linear double-stranded DNA can be achieved in wild- 
type Escherichia coli if electrocompetent cells are used. 

The problem to be solved is to provide a convenient method without the 
limitations of known techniques to create site-directed mutants of Escherichia coli 
20 (or other bacteria) by allelic exchange using linear fragments of DNA to identify 
an altered phenotype that will define the function or effect of the mutated gene. 
Tmditionally, the production of Escherichia coli mutants has been accomplished 
utilizing suicide vectors, which requires cloning of the gene disrupted with a 
selectable marker in such a vector. Winans et al. (7. BacterioL 161 : 1219-1221 
25 ( 1 985)) have shown that site-directed mutants of Escherichia coli strains can be 
obtained using linearized fragments of DNA containing a selectable marker 
flanked by homologous DNA. However, these examples assert the requirement of 
specific genetic backgrounds (e.g., recB, recC and/or sbcB). The recB and recC 
mutations inactivate exonuclease, preventing it from degrading the linear DNA, 
30 whereas the sbcB mutation restores recombination proficiency in a strain which 
carries recB and recC mutations. 

Although the above traditional methods permit generation of reliable site- 
directed mutants, there are clear disadvantages to both methods. The first one 
requires clonmg mto suicide vectors, which are available for only very limited 
3 5 number of bacteria. Additionally it requires an extra cloning step and is time 

consxuning. Another disadvantage of suicide vectors in gene replacement efforts is 
that considerable background results due to single recombination events. The 
second method of Winans et al. suffers from having to use special genetic 
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backgrounds which^ for several reasons may not be desirable when the organism 
to be mutated is £. colU or not available at all (i.e., most bacteria other than 
Escherichia coli). 

SUMMARY OF THE INVENTIQN 
5 This invention provides a method for targeted DNA replacement in 

bacteria comprising introducing into a recombination-proficient bacteria a 
replacement cassette of DNA comprising a 5' recombinant region and a 3* 
recombinant region, each region independently at least 75% identical to a DNA 
target region of the bacteria. The replacement cassette may or may not 
10 additionally include a functional gene positioned between the two recombinant 
regions that upon integration into host restores gene functionality. The basis for 
screening is the phenotypic efifect of a selectable marker that results from 
integration of the replacement cassette in the target sequence. Useful application 
of this method is to carry out site-directed DNA replacements for generation of 
1 5 desired DNA modifications in bacteria. 

BRIEF DESCRIPTION OF THE DRAWINGS AND 
SEQUENCE DESCRIPTIONS 
The invention can be more fully understood &om the following detailed 
description and the accompanying Figures and Sequence Listing which form a 
20 part of this application. 

Figure 1 shows a schematic diagram of E. coli genomic DNA region that 
harbors yc/X gene, the location of yciK ORF with respect to restriction sites 
relevant to gene constructions in this invention. Location of primers xised for 
amplifying ;;czA: DNA for pyciKCK, pSCyciKl and for knockout confirmations, is 
25 shown along v^th their Seq IDs. 

Figure 2 shows the plasmid construct pSCyciKl . yciK DNA flanked with 
engineered Hin^m zxAXhol sites was cloned into vector pBluescript II SK (+). 

Figure 3 shows a schematic diagram of HinCll fragment of pLoxCatl with 
relevant restriction sites used in engineering pyciKcat2 and pyciKcatS and for 
30 confirmations of chloramphenicol resistance marker (SEQ ID NO:7) in mutants. 
Location and direction of cat ORF is shown as an arrow. 

1 x^ui.^ -r ajuv/YVd uiw piooxiuu wuiouuwi ^^yv^uvcixtx. luc J7mv^u ixagmeni 
from yciK region (See Figure 1 ) was replaced with the 1 .2 kb HinCU-Smal 
fragment of pLoxCatl . cat gene is in opposite orientation to yciK gene. 
35 Figure 5 shows the plasmid construct pyciKcat3 engineered same as 

above. However, here cat gene is in same direction as yciK gene. 

Figure 6 is a gel confirming yciAT knockouts by genome-specific PGR. 
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Figure 7 is a gel confinning yciK knockouts by Southern hybridization of 
SstU digested genomic DNA from wild-type and mutual E. co// probed with SEQ 
ID NO: 18. 

Figure 8 shows a schematic diagram to describe Overiap-Extension PGR 
5 to generate tpiA deletion. 

Figure 9 shows the plasmid construct pRNl 06-2. A 2004 bp overlap 
extension PGR product with 73% of the ipU structuml gene deleted (Figure 8) 
was cloned into pGR-Blunt (Invitrogen) to yield the 5517 bp plasmid pRN106-2. 
The two extreme ends of tpiA are shown with an engineered Seal site in between. 
1 0 Figure 1 0 shows the plasmid construct pRN 107-1, the source of the 

linearized DNA fragment used to generate the tpU knockout The L2 kb HinCIl 
fragment from pLoxGatl containing the loxP-CmR-loxR cassette was inserted 
into the Seal site of pRN106-2 to yield the 6763 bp plasmid pRN107-L Using 
pRN107-l as template and primers SEQ ID NO: 1 and SEQ ID NO:4, the 3246 bp 
1 5 fragment containing tpiA flanking regions and the loxP-CmR-loxP cassette was 
PGR-amplified and this linear DNA fragment was used to transform KLP23 to 
generate ipiA mutants by double crossover. 

Figure 1 1 shows the design for confirmation of tpiA knockouts by 
genome-specific PGR. 
20 Applicant(s) have provided 1 9 sequences in confomiity with 

37 C,F.R. L821-L825 CRequirements for Patent Applications Containing 
Nucleotide Sequences and/or Amino Acid Sequence Disclosures - the Sequence 
Rules") and consistent with World Intellectual Property Organization (WIPO) 
Standard ST2.5 (1998) and the sequence listing requirements of the EPO and PCT 
25 (Rules 5.2 and 49,5(a-bis), and Section 208 and Amicx C of the Administration 
Instructions). 

SEQ ID NO:l is the nucleotide sequence of DNA fragment containing 
yciK gene. 

SEQ ID N0:2 is the forward primer for amplification of the yciK gene 
30 from the Escherichia coli genome, 

SEQ ID NO:3 is the reverse primer for amplification of the jczJi: gene from 

SEQ ID NO:4 forward primer with ^zwdlll site. 
SEQ ID NO:5 reverse primer with ATioI site. 

SEQ ID NO;6 is the nucleotide sequence of the yciK fragment flanked 
with/findra BnAXhol sites. 

SEQ ED NO:7 is the HinCll fi^gment of pLoxCatl. 
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SEQ ID NO: 8 is the deduced amino acid sequence encoded hy yczK open 
reading frame. 

SEQ ID NO:9 is the forward primer used in a PGR reaction to confirm 
yczK knockout. 

5 SEQ ID NO: 1 0 is the reverse primer used in a PGR reaction to confirm 

j^c/AT knockout, 

SEQ ID NO: 1 1 is a primer between sbp and cdh, 
SEQ ID NO:I2 is a primer within tpiA 3' region. 

SEQ ID NO: 13 is a primer within tpiA 5* region with an introduced Seal 

10 site, 

SEQ ID NO: 14 is a primer within >^/7/C 5' region* 
SEQ ID NO:15 is a primer at 5' end of sbp, 
SEQ ID NO:16 is a primer at 3' end of yiiR. 

SEQ ID NO:17 is expected PGR fragment when SEQ ID N0:9 and SEQ 
1 5 ID NO: 1 0 were used as primers with Escherichia coli genomic DNA as template. 
SEQ ID NO:18 is a 969bp Sstll fragment of pyciKCK. 
SEQ ID NO: 19 is the sequence of expected band in Southern experiment 
when SstU digested wild-type genomic DNA is probed with 969bp Sstll Augment 
(SEQ ID NO:18) of pyciKCK. 
20 DETAILED DESGRIPTIQN OF THE INVKNTTON 

The present invention provides a rapid and specific method for the 
modification or disruption of genes or other nucleic acid fragments. The thus- 
modified or disrupted genes may then be transfonned into their native hosts and 
screened for an altered phenotype, indicating gene function. The invention creates 
25 stable site-directed alterations in E. coli or other gram negative enteric bacterial 
genome by allelic exchange using Imear fragments of DNA to identify an altered 
phenotype that will define the function or effect of the mutated gene. The 
invention does not have the limitadons of the prior art with regard to time and 
restriction to specific genetic backgrounds. 
^0 Additionally the present method has application in the areas of fimctional 

genomics, proteomics, metabolomics where gene disruptions to reveal function of 

unicTinwn trenpQ ( ar> H t)i/*ir •r«*rti-1iir»*c!'\ imUM^tirM ^■^•^^^-^^^ ^.f i». j 

their products) or other fimctional DNA elements is desirable. Similarly the 
present method may be used for metabolic engineering in techniques such as gene 
35 silencing by disruption, gene integration or replacement to add genes of new 

function or to replace a defective gene with a functional gene. In similar fashion 
the present method may be useful to regulate or modify genes or operons by 

7 
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altering or modifying regulatory sequences such as promoters, ribosomal binding 
sites, terminators, and enhancers, for example. 

In this disclosTire, a number of terms and abbreviations are tised. The 
following definitions are provided. 
5 "Polymerase chain reaction" is abbreviated PGR, 

"Open reading frame" is abbreviated ORF. 
"Isopropyl p-D"Thiogalactoside'* is abbreviated IPTG. 
"Sodium dodecyi sulfate*' is abbreviated SDS. 
"Luria-Bertani" is abbreviated LB. 
10 "Gene" refers to a nucleic acid firagment that expresses a specific RNA 

only or a specific RNA and protein, including regulatory sequences preceding 
(5' noncoding sequences) and following (3' noncoding sequences) the coding 
sequence. The terms "synthetic genes" and "synthetic DNA" refer to genes and 
DNA that can be assembled from oligonucleotide bxiilding blocks that are 
1 5 chemically synthesized using procedures known to those skilled in the art. These 
building blocks are ligated and annealed to form gene constructs. "Chemically 
synthesized", as related to a sequence of DNA, means that the component 
nucleotides were assembled in vitro. Manual chemical synthesis of DNA may be 
accomplished using well established procedures, or automated chemical synthesis 
20 can be performed using one of a number of commercially available machines, 
"Gene constmcts" may contain a full gene or less or more than a full gene. 

The term "gene dismption" is used mterchangeably with the term "gene 
knock out" to refer to the process of interfering within a gene such that no 
functional gene product is expressed. 
25 The term "gene replacement" refers to a process which replaces a gene 

with either a functional or a mutated gene such that either no gene product is 
expressed or a mutant gene product is expressed. 

The term *T)NA replacement" means a process in which one fragment of 
DNA is replaced wdth another DNA fragment. 
30 The term "gene modification" means any process where a gene is altered 

in any way including gene disruption or gene replacement. 
The term "cene tar^etincr" refers tn 

w o — ' — ^-.-w^-^^irfw TTMw&v M. uj^vuxAW ^fi\\^ VYlLUlll. ix 

gene or nucleic acid fragment is identified or targeted on the basis of sequence. 

The term "nucleic acid fragment" refers to a polymer of RNA or DNA that 
35 is smgle- or double-stranded, optionally containing synthetic, non-natural or 
altered nucleotide bases. A nucleic acid fragment in the form of a polymer of 
DNA may be comprised of one or more segments of cDNA, genomic DNA, or 
synthetic DNA. A nucleic acid firagment may be a portion of a gene, may be 

8 
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synthetic, or may be a genetic regulatory element. A "target nucleic acid 
fragment'' or **target gene" or "target DNA'* is any nucleic acid fragment that is 
inserted into a modification plasmid that is targeted for modification or disruption. 
The terms "plasmid", 'Vector", and "cassette'' refer to an extra 
5 chromosomal element often carrying genes which are not part of the central 
metabolism of the cell, and usually hi the form of circular double-stranded DNA 
molecxilcs. Such elements may be autonomously replicating sequences, genome 
integrating sequences, phage or nucleotide sequences, linear or circular, of a 
single- or double-stranded DNA or RNA, derived from any source, in which a 

1 0 number of nucleotide sequences have been joined or recombined into a unique 
construction which is capable of introducing a promoter fragment and DNA 
sequence for a selected gene product along with appropriate 3* untranslated 
sequence into a celL "Transformation cassette" refers to a specific vector 
containing a foreign gene and having elements in addition to the foreign gene that 

1 5 facihtate transformation of a particular host cell. "Expression cassette" refers to a 
specific vector containing a foreign gene and having elements in addition to the 
foreign gene that allow for expression of that gene in a foreign host. 

The term "selectable marker** refers to those genes encoding proteins that 
may be expressed and/or which convey a phenotype on the host that enables 

20 selection, A defective or fimctional gene may itself act as a selectable marker if 
its presence leads to a phenotype that can be selected against the bacl^round. 
"Background" is undesirable colonics that appear during screening of gene 
replacement events. The term "modification plasmid" refers to a specialized 
plasmid for use in the present invention comprising, at a minimum, a plasmid- 

25 specific marker and a replacement cassette. 

"Enteric-specific selectable markers" are those genes encoding proteins 
that may be expressed in Enteric bacteria (e.g., Salmonella sp., Escherichia sp.), 
which convey a phenotype on the enteric host that enables selection. "Enteric 
bacteria" are members of the family Enter obacteriaceae^ and include such 

30 members as Escherichia^ Salmonella^ and Shigella. They are gram-negative 
straight rods, 0.3-1 .0 X 1 .0-6.0 |im, motile by peritrichous flagella, except for 
Tntufttellci^ or nonmotile. They grow in the presence and absence of oxygen and 
grow well on peptone, meat extract, and (usually) MacConkey's media. Some 
grow on D-glucose as the sole source of carbon, whereas others require vitamins 

35 and/or mineral(s). They are chemoorgano-trophtc with respiratory and 

fermentative metabolism but arc not halophilic. Acid and often visible gas is 
produced dming fermentation of D-glucose, other carbohydrates, and 
polyhydroxyl alcohols. They are oxidase negative and, with the exception of 
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Shigella dysenteriae 0 group 1 and Xencrhabdus nematophilus, catalase positive. 
Nitrate is reduced to nitrite except by some strains of Erwinia and Yersina. The 
G + C content of DNA is 38-60 mol % (1^, Bd). DNAs from species from 
species within most genera axe at least 20% related to one another and to 
5 Escherichia coli, the type species of the family. Notable exceptions are species of 
Yersina^ Proteus, Providenicay Hafitia and Edwardsiella, whose DNAs are 
10-20% related to those of species from other genera. Except for Erwinia 
chrysanthemi all species tested contain the enterobacterial common antigen 
(Bergy's Manual of Systematic Bacteriology, D. H. Bergy, et al., Baltimore: 
1 0 Williams and Wilkins, 1 984). 

The temis "KO vector" or "knock out vector" refer to a modification 
plasmid lacking the target gene to be modified. 

The term "knock out plasmid" refers to a modification plasmid that 
contains a gene targeted for disruption. 
15 The term "knock out cassette" refers to a modification cassette that 

contains recombination regions designed to insert a modifying nucleic acid within 
the coding region of the target gene so as to prevent the effective expression of 
that target gene. 

A "replacement cassette" refers to a specialized DNA cassette that 
20 comprises at a minimum, a selectable marker, a modifying DNA and 5' and 3' 

recombination regions. Replacement cassettes may optionally also comprise other 
modifying DNA or RNA sequences, mserted between the flanking recombination 
regions. Within the context of the present invention, replacement cassettes 
containing modifying DNA mteract with the DNA to be replaced via the 
25 mechanism of homologous recombination to permit modification or dismption of 
die target gene or target nucleic acid. 

The teim "genomic host" refers to the cell or host from which the target 
gene or DNA has been cloned. 

The term "inserting nucleic acid fiagment" refers to a DNA ("inserting 
30 DNA Augment") or RNA ("mserting RNA firagment") molecule residing in a 
replacement cassette that is useful for the modification or dismption of a target 

a — ^ — — — — — - xAAw xAwwi.uu.^ jjiuwAWidrf oviu ixo^iiii^xit wixi 

insert at a site directed by the sequence of the recombination regions on the 
cassette. "Inserting DNA" may be eitiier ' Wdifying" or "dismpting". 
35 "Modifying DNA" or "modifying nucleic acid fiagments" will result in the 
altering of the composition or function of a target gene but will not dismpt the 
gene. "Dismpting DNA" or "disrupting nucleic acid fi:agments" will have the 
effect of disrapting the target gene of interest. "Modifymg DNA" or "disrupting 
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DNA" will include but will not be limited to non-specific DNA or RNA 
sequences, selectable markers, origins of replication, antisense sequences and 
regulatory elements. As used herein, "regulatory elements" refer to nucleotide 
sequences located upstream (5'), within, and/or downstream (3') to a coding 
5 sequence, which control the transcription and/or expression of the coding 

sequences, potentially in conjunction with the protein biosynthetic apparatus of 
the cell. In artificial DNA constructs regulatory elements can also control the 
transcription and stability of antisense RNA. One specific regulatory element is a 
"promoter". "Promoter" refers to a DNA sequence capable of controlling the 

10 expression of a coding sequence or functional RNA. In general, a coding 

sequence is located 3' to a promoter sequence. Promoters may be derived in their 
entirety from a native gene, or be composed of different elements derived from 
different promoters found in nature, or even comprise synthetic DNA segments. It 
is understood by those skilled in the art that different promoters may direct the 

15 expression of a gene in different tissues or cell types, or at different stages of 
development, or in response to different environmental conditions. Promoters 
which cause a gene to be expressed in most cell types at most times are conunonly 
referred to as "constitutive promoters". It is further recognized that since m most 
cases the exact boundaries of regulatory sequences have not been completely 

20 defined, DNA fragments of different lengths may have identical promoter activity. 

The term "recombination region" refers to 3' and 5' flanking nucleic acid 
regions on the replacement cassette. Recombination regions are designed to be 
either complementary to, or have significant base identity with, the corresponding 
regions of the target genes or target nucleic acid fragments to be dismpted or 

25 modified. Significant identity, for example, may range from about 70% between 
the bases of the target gene ^target region") and the recombination regions. 
Recombination regions at the 5' end of the cassette are refenred to as "5' 
recombination regions" and recombination regions at the 3* end of the cassette are 
referred to as "3' recombination regions". The maximum lengtii of the flanking 

30 recombination region is set at approximately 150 bases when using PGR 
amplification in the technique. 

nucleotide bases that are hybridizable to one another. For example, with respect 
to DNA, adenosine is complementary to thymine and cytosine is complementary 
35 to guanine. 

The term "percent identity" is a relationship between two or more 
polynucleotide sequences as determined by comparing the sequences. In the art, 
"identity" also means the degree of sequence relatedness between polynucleotide 

11 
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sequences as determined by the match between strings of such sequences. 
"Identity" and "similarity" can be readily calculated by known methods, including 
but not limited to those described in: Computational Molecular Biology (Lesk, A. 
M., ed.) Oxford University Press, New York (1988); Biocomnnting : Infnnnatir^ 
3 and q^ngmg Projects (Smith, D. W., ed.) Academic Press, New York ( 1 993); 
Computer Analysis of Seq uence Data. Part T (Griffin, A. M., and GrifBn, H. G., 
eds.) Humana Press, New Jersey (1994); Sequence Analysis In Mol^rnlar BinloffY 
(von Heinje, G,, ed.) Academic Press (1987); and Sequence Analysis Primer 
(Gribskov, M. and Devereux, J., eds.) Stockton Press, New York (1991). 
Prefened methods to determine identity are designed to give the largest match 
between the sequences tested. Methods to detennine identity and similarity are 
codified m publicly available computer programs. Prefcnred computer program 
methods to detennine identity and similarity between two sequences include, but 
are not limited to, the GCG Pileup program found in the GCG program package, 
1 5 as used in the instant invention, using the Needleman and Wunsch algorithm with 
their standard default values of gap creation penalt5r=12 and gap extension 
penahy=4 (Devereux et al.. Nucleic Acids Res. 12:387-395 (1 984)), BLASTP, 
BLASTN, and FASTA (Pearson et al., Proc. Natl. Acad. Sci. U.S.A. 
85:2444-2448 (1988). The BLAST X program is publicly available from NCBI 
20 and other sources (1BLAST Manual Altschul et al., Natl. Cent. Biotechnol. hif, 
Natl. Library Med. (NCBI NLM) NIH, Bethesda, MD 20894; Altschul et al., J.' 
Mol. Biol. 215:403-410 (1990)). Another preferred method to detennine pe^t 
identity, is by the method of DNASTAR protein alignment protocol using the 
Jotun-Hein algorithm (Hein etal.. Methods En2ymol. 183:626-645 (1990)). 
25 Default parameters for the Jotun-Hein method for alignments are: for multiple 
alignments, gap penally=l 1, gap length penalty=3; for pairwise alignments 
ktuple=6. By way of illustration, to obtain a polynucleotide having a nucleotide 
sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 
die nucleotides in the reference sequence may be deleted or substituted with 
30 another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in 
the reference sequence may be inserted into the reference sequence. These 
mutations of the reference 

I — — - — — ^ ^^^^ fcuw ^ vrji »vAiiiiuni puamuilii %Jl 

the reference nucleotide sequence or anywhere between those terminal positions, 
interspersed either individuaUy among nucleotides in the reference sequence or in 
3 5 one or more contiguous groups within the reference sequence. 

The term "a partial sequence" or "a portion of a sequence" refers to a 
sequence of sufScient length to permit homologous recombination according to 
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the conditions of the present method. Typically, "a partial sequence" or "a portion 
of a sequence" will range from about 15 bp to about 200 bp. 

The term "coding sequence" refers to a DNA sequence that codes for a 
specific amino acid sequence. "Suitable regulatory sequences" refer to nucleotide 
5 sequences located upstream (5^ noncoding sequences), within, or downstream 
(3' noncoding sequences) of a coding sequence, and which influence the 
transcription, RNA processing or stability, or translation of the associated coding 
sequence. Regulatory sequences may include promoters, translation leader 
sequences, introns, and polyadenylation recognition sequences, 

10 The term "operably linked" refers to the association of nucleic acid 

sequences on a single nucleic acid fragment so that the fixnction of one is affected 
by the other. For example, a promoter is operably linked with a coding sequence 
when it is capable of affecting the expression of that coding sequence (i.e., that the 
coding sequence is under the transcriptional control of the promoter). Codmg 

1 5 sequences can be operably linked to regulatory sequences in sense or antisense 
orientation. 

The term "operon" refers to a group of operably linked genes and 
regulatory elements that functions as a unit of transcription. An operon starts with 
a promoter, which binds RNA polymerase and initiates transcription of the 
20 operon. 

The term "expression" refers to the transcription and stable accimiulation 
of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of the 
invention. Expression may also refer to translation of mRNA into a polypeptide. 

The term "recombination proficient" means that a microorganism is 
25 capable of integrating extm-chromosomal DNA mto its genome via homologous 
recombination. Additionally, a "recombination proficient" microorganism is also 
capable of recombining DNA resident on chromosome if sufficient homology 
exists between another DNA locus. 

"Transformation" refers to the transfer of a nucleic acid firagment into the 
30 host organism. "Stable transformation" refers to integrating the nucleic acid into 
the genome of the cell, 

"Absolute recombination frepuencies" rnnge from 2.3 X 1 0*^ to 1 .5 X 10"^ 
in E. coll depending on the system and length of homology involved. RecA is the 
most crucial component for the homologous recombination reaction, although 
35 recA independent mechanisms also exists. Homologs and analogs of rec^-like 
proteins and other genes linked to homologous recombination have been 
discovered in a wide range of bacteria and eukaryotes. (Lloyd, R. G. and Low, 
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10 



K. B, (1996) Homologous Recombination. In Escherichia coli and Salmonella 
Cellular and Molecular Biology, Ed. F.C. Neidhardt, ASM Press, p 2236-2255). 

For a gene replacement to occur which requires two homologous 
recombination event absolute recombination frequencies can be calculated to be in 
the range of 10-12 to 10-2. 

Linear DNA is the replacement cassette that is isolated from a 
modification plasmid by restriction digestion or by PGR amplification as 
described in Examples 1 and 4. Preparation of linear DNA is also described in 
Examples. 

Although linear DNA has been used for gene replacements in some 
bacteria such as Bacillus subtilis, literature has strongly indicated that gene 
replacements using linear DNA do not occur in wild type bacteria such as E, coli 
and Salmonella unless some mutations or special genes are introduced to render 
these organisms hyper-recombinant. Use of electroporation, offering the highest 
1 5 efBciencies of transformation among the presently known methods, demonstrates 
that linear DNA can be used for gene replacements in wild type £. coli. By 
achieving high eflSciencies of transformation, gene replacements can be performed 
in any recombination proficient bacteria which carry a functional recombination 
system. 

20 The present method is usefiil for the rapid disruption or modification of 

genes or other genetic elements. Genes of unknown fimction which are disrupted 

in this fashion are transformed into genomic hosts where screening for an eiitered 

phenotype will reveal their function. 

This method is also useful for replacing defective or mutated genes &om 
25 bacterial genome with the functional gene provided a selection phenotype exist for 

the function of the gene. 

The gene replacement techniques described in examples to follow can also 

be used for regulation of genes present in operons by replacement of operably 

linked promoter region with a promoter of desired strength, 
30 The method described in this invention to obtain gene replacement can be 

used in enteric bacteria without specialized host backgrounds orxise of suicide 

replacements with linear DNA in wild-type Escherichia coli on plasmid and 
chromosomal targets. However, any methods (chemical transformation or 
35 electrotransformation) that ensure a transformation efficiency high enough to 
enable the occurrence of double homologous recombination between the target 
DNA and the replacement cassette will work, in this specific case electro- 
transformation was used. The method of electrotransformation itself does not 
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guarantee a success in gene replacements if quality of competent cells is not good 
or if host strains deficient in recombination functions are used. A high efficiency 
of transformation must occur in recombination proficient bacteria. Depending on 
the absolute recombination frequencies of target gene (or target DNA) locus and 
5 the host strain, required transformation efficiencies will vary. Use of long PGR 
primers as described in Example 3 completely eliminates any cloning 
requirements for gene replacements. 

Therefore, the present method is useful for the rapid dismption or 
modification of genes or other genetic elements. Genes of unknown function 
1 0 disrupted in this fashion may be transformed into genomic hosts where screening 
for an altered phenotype will reveal their function. 

The present invention is further defined in the following Examples. It 
should be understood that these Examples, while indicating preferred 
embodiments of the mvention, are given by way of illustration only. From the 
1 5 above discussion and these Examples, one skilled in the art can ascertain the 
essential characteristics of this invention, and without departing from the spirit 
and scope thereof, can make various changes and modifications of the mvention to 
adapt it to varioxis usages and conditions. 

EXAMPLES 

20 GENERAL METHODS 

Procedures required for PGR amplification, DNA modifications by endo- 
and exonucleases for generating desired ends for cloning of DNA, ligations, and 
bacterial transformation are well known in the art. Standard recombinant DNA 
and molecular cloning techniques used here are well known in the art and are 

25 described by Sambrook, J., Fritsch, E. F. and Maniatis, T, Molecular Cloning: A 
Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor 
(1989) (hereinafter **Sambrook''); and by T. J. Silhavy, M. L. Bennau, and L. W. 
Enquist, Experiments with Gene Fusions. Cold Spring Harbor Laboratory Press, 
Cold Spring, N.Y. (1984) and by Ausubel, F. M. et al.. Current Protocols in 

30 Molecular Biology. John Wiley & Sons, Inc. (1994-98). 

For computational nucleic acid and protein sequence analysis Wisconsin 
Package Version 9.0 and 10.0, Genetics Computer Group (GCG) and Vector NT! 
Deluxe v4.0.3 software and database packages were used. Unless specified 
otherwise, all default paramenters were used. 

35 Materials and methods suitable for the maintenance and growth of 

bacterial cultures are well known in the art. Techniques suitable for use in the 
following examples may be found as set out in Manual of Methods for General 
Bacteriology (Phillipp Gerhardt, R. G. E, Murray, Ralph N. Costilow, Eugene 
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Nester, WUlis A. Wood, Noel Krieg and G. Briggs Phillips, eds.), American 
Society for Microbiology, Washington, DC. (1994)) or by Thomas D. Brock, in 
Biotechnology: A Te xtbook of Industrial Microbiology, Second Edition, Sinauer 
Associates, Inc., Sunderland, MA (1989). All reagents, restriction enzymes and 
5 materials used for the growth and maintenance of bacterial cells were obtained 
from Aldrich Chemicals (Milwaukee, WI), DIFCO Laboratories (Detroit, MI), 
GIBCO/BRL (Gaithersburg, MD), or Sigma Chemical Company (St. Louis, MO) 
unless otherwise specified 

PGR reactions were run on GeneAMP PGR System 9700 using Amplitaq 
10 or Amplitaq Gold enzymes (PE Applied Biosystems, Foster City, CA). The 
cycling conditions and reactions were standardized according to manufacture's 
instructions. 

The meaning of abbreviations is as follows: "sec" means second(s), "min" 
means niinute(s), "h" means hour(s), "d" means day(s), "^L" means microliter, 
1 5 "mL" means milliliters, "L" means liters, "mM" means millimolar, "M" means 
molar, "mmol" means millimole(s), "Cm^' means chloramphenicol resistance, 
"cat" means chloroamphenicol acetyl transferase, "Ampr*' means ampicillin 
resistance, "AmpS" means ampicillin sensitivity, "kb" means kilo base. 

EXAMPLE 1 

20 Targeted Gene Replacement by Linear DNA Fragments 

for Site-directed Mutagenesis of Escherichia coli 
Example 1 describes the cloning, mutation and integration of a mutant 
yciK gene into the Escherichia coli genome by homologous recombination using 
linearized mutant DNA. 
25 Isolation of vc/ATgene : 

A 1460 bp fragment of DNA (SEQ ID NOrl, Figure 1) containing yczJT 
gene was amplified using pCR-Script Amp Cloning kit from Stratagene (catalog 
#21 1 188) from genomic DNA of Escherichia coli strain W3 1 1 0 (Sigma- catalog 
#D0421). PCR amplification (MuUis and Faloona, Methods EnzymoL 
30 1 55:335-350 (1 987)) was performed using oligonucleotides SEQ ID N0:2 and 
SEQ ID NO:3 as forward- and reverse primers, respectively (Figure 1 ). The PCR 

nmHlirt ^X7«»o Kln-nt-OTirlo^ n^A lirm^nrl ^^i-^^ O*.^ x-^ /^T» n _ - ^ 

Amp SK(+) followed by transformation mto XL-1 Blue MRF' Kan 
supercompetent cells. Transformed cells were plated on Luria-Bertani medium 
35 containing ampicillin, X-gal and IPTG. Plasmids were isolated from v^te, 
ampicillin-resistant transfonnants and tested for the presence of the insert by 
restriction digestion. One such plasmid, pyciKCK, was subsequently confirmed 
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for the insert containing yciK gene by sequencing on ABI PRISM (Model 377, 
Version 2. LI). 

Cloning of vciK gene in a vector with suitable restriction sites : 

The pyciKCK DNA was \ised as a template with primers SEQ ID NO:4 
5 and SEQ ID N0:5 to generate a DNA fragment (SEQ ID NO:6) containing yciK 
gene flanked with HindlJl andJChal restriction sites (Figure 1) for cloning in 
Hin6miXhol restricted pBluescript 11 SK(-+-) vector (Stratagene; catalog #212205). 
The resultant plaraid was named pSCyciKl (Figure 2). 
In vitro disrup tion of yciK gene with insertional mutagenesis : 

^ 0 To disrupt yciK open reading frame (ORF), a HinclL fragment (Figwe 2) of 

the plasmid pSCyciKl which encodes amino acids 124 to 189 of j^ciJ^'ORF (total 
length=252 amino acids; SEQ ID NO:8) was replaced with a 1.2 kb HinClI-Smal 
fragment (bases 1-1225 of the SEQ ID NO:7 (see also Figure 3), constituting the 
sense strand of the selectable chloramphenicol-resistance encoding gene flanked 

15 by bacteriophage PI loxF sites (Snaith et al., Gene 166:173-174 (1995) in plasmid 
pLoxCatl). This construction yielded two plasmids pyciKcat2 (Figure 4) and 
pyciKcaO (Figure 5) containing Cm"* ORF (cat) m opposite or same orientation as 
yciK^ respectively. 

Introduction o f vc/iT mutation into Escherichia coli genome bv homologous 

20 recombina tion using linearized mutant DNA : 

The plasmid construct (pyciKcat2 or pyciKcaG) was digested with Aflm 
and Xmnl, followed by gel purification of a 3 .9 kb band of linearized DNA 
fragments (Figures 3 and 4) containing mutant ^c/A' gene. Escherichia coli strains 
FM5 (ATCC 5391 1) and KLP23 (FM5 glpK'gldA^ were electrotransformed with 

25 0.5 ^g of the above 3 .9 kb linear DNA and the resulting transfortnants were 
screened for double recombinant phenotype of chloramphenicol resistance 
(25 lag/mL) and ampicillin sensitivity (1 00 |xg/mL) on LB medium. High 
numbers of transformants (roughly lO^-lO^) were screened. Genomic DNA from 
Cm^Amps colonies was dot blotted on Hybond N"** positively charged nylon 

30 membrane (Amersham; catalog # RPN2020B), A 982 bp Jt&al fragment (bases 
584 039 of SEQ ID N0:7 denote the sense strand) that contains CmF encoding 
«^-*^uxwx*i^ inru£> uowu a piwut^. r\xL \^iii--rtxup'- uuiuuics ic2iLcu pusiuve xor tne 
presence of DNA encoding chloramphenicol resistance gene under high 
stringency conditions (0.1 x SSC, 0.1% SDS, 60^C for 15 min). Probe labeling, 

35 hybridization and detection for dot blots and subsequent Southern blotting 

experiments, were performed using ECL random prime labelling and detection 
systems version II, (Amersham International pic, Buckinghamshire, England). 
Genomic DNA from randomly selected CmrAmp'' colonies, that also tested 
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positive for the presence of the cat (encodes chloramphenicol acetyl transferase) 
gene, was used as template for PGR reactions using oligonucleotide pair (SEQ ID 
NO:9 and SEQ ID NO: 10; Figure 1). These primers were designed such that only 
genome-specific yciK (wild-type or mutant) amplification could occur. Plasmid- 
5 homt yciK (wild-type in pSCyciKl or mutant in pyciKcat2/pyciKcat3) did not 
contain complementary sequences to these primers and as a result could not yield 
any PGR product This design was for avoiding false positives. The DNA fi-om 
mutant colonies yielded a 3,1 kb FCR product compared to 2.1 kb product from 
wild-type (SEQ ID NO: 17; Figure 6) confirming ihziyciK gene in the 

10 Escherichia coli genome contained an insertion as predicted by the construct 

design. The mutants were further authenticated by Southern hybridization of Sstll 
digested genomic DNA fragments (Figure 1) probed with 969 bp SstU, fiagment of 
pyciKGK containing yciK DNA (SEQ ID NO: 1 8), All the mutants yielded a 
23 kb band compared to the L3 kb band (SEQ ID NO: 19) of wild-type strains 

15 (Figure?), 

This Example demonstrated that several independent ^ciiT knockouts were 
obtained using two different gene constructs and two different genetic 
backgrounds of E. coli hosts, FM204 was a FM5 derivative obtained by 
integrating linearized DNA from pyciKcat2; the FM5 integrants that resulted from 
20 the pyciKcatB AflUl-Xmnl fragment were named FM301 and FM302. Sunilariy, 
KLP201 and KLP202 were KLP23 derivatives using pyciKcat2 fragments. 
KLP301 and KLP304 were also derived from KLP23 by using the fragment fi-om 
pyciKcatB. 

EXAMPLE 2 

25 Targeted Replacement of Mutant Gene with 

a Functional Gene Using Linear DNA Fragments 
Linear DNA molecules can also be used to replace mutant genes with the 
wild-type gene. If a mutant has a phenotype of lack of growth imder certain 
nutrient or environmental conditions, a replacement of wild-type gene can be 
30 selected by acquired ability for growth under same conditions. For example, 
hemB mutants of Escherichia coli and Bradyrhizobium japonicum require 

{1 Bacterial 175:7222-7227(1993)), A Imear DNA fragment containing 
fimctional hemB gene from a homologous or heterologous source is amplified (by 
35 PGR or by cloning). The Escherichia coli hemB mutant strain RP523 (Li, J. M. 
et al., J. Bacterial 170:1021-1025 (1988) is electrotransformed with the ampUfied 
linear(ized) DNA fragments that harbor fimctional hemB gene. Transformants arc 
screened on LB plates for growth. Any colonies that appear are due to a 
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functional hemB gene in the genome. If insertional mutations are being replaced, 
genomic constmcts are further tested by PGR amplification or Southern 
hybridizations of the region of DNA modified. If the insertion also contains a 
selectable marker, the transformauts may be screened for the absence of the 
5 phenotypc conferred by the selectable marker. In case a point mutation is to be 
replaced, the PGR amplified fragment is sequenced either directly or after cloning 
in a suitable vector (e.g. pBluescript II SK(-l-), Stratagene, Cat #212205) for the 
presence of wild-type sequence. This method is extremely beneficial for restoring 
specific gene functionality in a host strain that has gone through several sequential 
1 0 mutations in the genome, 

EXAMPLES 
One-step PGR Method to Generate Linear DNA 
Fragments for Site-directed Mutagenesis 
Oligonucleotide pairs approximately 150 bases in length are synthesized. 
15 Up to 30 bp on the 3 -end of forward primer is identical to a region flanking the 
5'-end of a selectable marker and the rest of the bases are identical to the 5' region 
of the gene to be disrupted. Up to 30 bp on the 3*-end of reverse primer are 
complementary to a region flanking the 3*-end of a selectable marker and the rest 
of the bases are identical to the 3' region of the gene to be dismpted. The above 
20 oligonucleotides are used in a PGR reaction to amplify DNA fragments that 
contain a marker gene flanked on each side with at least 20 bases of sequence 
homologous to the gene desired to be mutated. The PGR product is used to 
electrotransform wild-type Escherichia coli cells for generating mutants by 
homologous recombination which can be tested by methods described in 
25 Example 1. 

EXAMPLE 4 

Engineering of Triosephosphate Isomerase Mutant of Escherichia coli KLP23 
This example describes the construction of plasmid for triosephosphate 
isomerase gene replacement in Escherichia coli KLP23 and engineering of 
30 triosephosphate isomerase mutant RJ8m by linear DNA transformation. 

Construction of plasmid for triosephosphate isomerase gene replacement in 
Escherickia coli KLP23 ; 

Escherichia coli KLP23 genomic DNA was prepared using the Puregene 
DNA Isolation Kit (Centra Systems, Minneapolis, MN)* A LO kb DNA fragment 
35 containing cdh and the 3' end of triosephosphate isomerase (IpiA) genes was 

amplified by PGR (Mullis and Faloona, Methods Enzymol. 155:335-350 (1987)) 
from KLP23 genomic DNA using primers (SEQ ID NO: 1 1 and SEQ ID NO: 12). 
A LO kb DNA fragment containing the 5* end of tpiA.yiiQ, and the 5' end oiyiiR 
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genes was amplified by PGR from KLP23 genomic DNA using primers SEQ ID 
N0:13 and SEQ ID NO:14. The first 26 bases at 5* end of piimer SEQ ID NO:13 
are complementary to the primer SEQ ID NO: 12 to enable subsequent overlap 
extension PGR, followed by a Seal (AGTACT) site. The next 28 bases following 
5 Seal site at the 3' end of SEQ ID: 13 reside on 5* end of the tpiA gene (Figure 8). 
The gene splicing by overlap extension technique (Horton et al., BioTechniques 
8:528-535 (1990)) was used to generate a 2.0 kb fragment by PGR using the 
above two PGR fragments as templates and primers SEQ ID NO: 1 1 and SEQ ID 
NO: 1 4, This fragment represented a deletion of 73% of the 768 bp tpiA ORF. 

1 0 Overall, this fragment had 1 .0 kb flanking regions on either side of the .Seal 

cloning site (within the partial tpiA) to allow for chromosomal gene replacement 
by homologous recombination. 

The above blunt-ended 2.0 kb PGR fragment was cloned into the 
pCR-Blunt vector using the Zero Blunt PGR Gloning Kit (Invitrogen, San Diego, 

1 5 G A) to yield the 5.5 kb plasmid pRNl 06-2 (Figure 9) containing kanamycin and 
Zeocin resistance genes* The 1 .2 kb Hindi fragment (SEQ ID NO:7, Figure 3) 
containing a chloramphenicol-resistance gene flanked by bacteriophage PI loxP 
sites (Snaith et al.. Gene 166:173-174 (1995)), was used to intermpt the tpiA 
fragment in plasmid pKN106-2 by ligating it to iScal-digested plasmid pRN106-2 

20 to yield the 6,8 kb plasmid pRN107-l (Figure 10), 

Enrineering of triosenhosphate isomerase mutant (RJ8m) bv linear DNA 
transformation : 

Usmg pRN107-l as template and primers SEQ ID NO:l 1 and SEQ ID 
NO: 14, the 3*2 kb fragment containing tpiA flankmg regions and chloramphenical 

25 resistance marker (SEQ ID N0:7) was PGR-amplified and gel-extracted. 

Escherichia coU KLP23 was electrotransfonned with about 1 of this 3.2 kb 
linear DNA fragment Transformants that were chloramphenicol-resistant 
(12*5 Hg/mL) and kanamycin-sensitive (30 ^tg/mL) were further screened on M9 
minim a l media for poor glucose utilization on 1 mM glucose, a phenotype 

30 expected for tpiA mutant. An EcoBl digest of genomic DNA from one such 
mutant, RJSm, was probed with the intact tpiA gene via Southern analysis 
(Southern, 1 MoL Biol 98:503-517 (1975)). The results indicated that it was a 
double-crossover integrant {tpiA gene replacement) as the two expected 6.6 kb and 
3.0 kb bands were observed, owing to the presence of an additional EcoKl site 

35 within the chloramphenicol resistance gene. As expected, the host KLP23 and 
wild-type FM5 controls yielded single 8.9 kb and 9.4 kb bands respectively. This 
tpiA mutant was fiirther analyzed by genomic PGR using primers SEQ ID NO: 15 
and SEQ ID NO: 16 (Figure 1 1), which yielded the expected 4,6 kb PGR fragment 
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while for the same primer pair the host KLP23 and wild-type FM5 strains both 
yielded the expected 3.9 kb PGR ftagment. When cell-free extracts from tpiA 
mutant RJ8m and host KLP23 were tested for tpiA activity using glyceraldehyde 
3-phosphate as substrate, no activity was observed with RJ8m. Taken together, a 
5 confirmation of tpiA knockout was demonstrated. 
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CLAIMS 

1 . A method for gene replacement in recombination-proficient bacteria 
comprising: 

(a) transforming recombination-proficient bacteria with a 

5 replacement cassette containing 5' and 3' flanking sequences, each 

flanking sequence independently with at least 75% identity to the 
target DNA; and 

(b) screening for an altered phenotype. 

2. The method of Claim 1 in which absolute recombination frequencies 
10 are at least 10"^ to enable selection of double homologous recombination event(s). 

3. The method of Claim 1 in which the transforming step is performed 
chemically or by electroporation. 

4. The method of Claim 1 in which the recombination-proficient bacteria 
are enteric bacteria. 

15 5. A method of gene disruptions in recombination-proficient enteric 

bacteria comprising transforamig the recombination-proficient enteric bacteria 
with a replacement cassette containing 5' and 3' flanking sequences, each flanking 
sequence independently with at least 75% identity to the target DNA. 

6. A method of replacing mutant genes with functional genes in 

20 recombination-proficient enteric bacteria comprising transforming recombination- 
proficient bacteria with a replacement cassette containing 5* and 3' flanking 
sequences, each flanking sequence independently with at least 75% identity to the 
target DNA. 

7. A method of regxilating complex operons comprising deleting or 
25 replacing one or more promoter regions operably linked to the complex operon 

with at least one alternate desired promoter region. 

8. A method according to any one of Claims 1, 2, 3, 4, or 5 for gene 
replacement in recombination-proficient bacteria further comprising: prior to 
transforming the recombination-proficient the replacement cassette is genemted 

30 using direct PCR of a selection marker wherein the direct PCR uses long primer 
pairs, each primer having at least 75% identity to the target DNA at the 5' end and 
at least 75% identity to the selection marker at the 3' end. 
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SEQUENCE LISTING 



<110> E. I. du Pont de Nemours and Company 

<120> Targeted Gene Replacements in Enteric Bacteria Using 
Linear DNA 

<130> BClOlO POT 

<140> 
<141> 

<150> 60/141,103 

<151> 1999-06-25 

<160> 19 

<170> Microsoft Office 97 

<210> 1 
<211> 1460 
<212> DNA 

<213> Escherichia coli 
<400> 1 

cataaagccc caaagtcata aagtacactg gcagcgcgac agacaacgtc gttgaggagt 60 
agccatgcat taccagccaa aacaagattt actcaatgat cgcattatcc tggtgacggg 120 
agccagcgat ggtattggtc gtgaagccgc gatgacgtat gcacgctatg gtgcgacagt 180 
gattctgttg ggccgtaatg aagaaaaatt acgtcaggta gccagccaca taaacgaaga 24 0 
aactgggcgt cagccacagt ggtttattct cgatttgctg acctgcacgt ccgaaaattzg 300 
ccaacaactg gcacagcgca ttgccgttaa ttatccgcgt ctggatggtg ttttgcataa 360 
tgccggattg ctcggcgatg tttgcccaat gagcgaacaa aatccgcagg tctggcagga 420 
cgtcatgcag gtcaacgtta atgccacctt tatgctcacc caggcactgc ttcctttatt 480 
actcaaatcg gacgccggtt cactggtctt tacttcatca agcgttggac gtcagggacg 540 
agccaactgg ggtgcatatg cagcgtcgaa atttgccacc gaagggatga tgcaggtact 600 
ggccgatgaa tatcagcagc gcctgcgtgt caactgcatt aacccaggcg gtacgcgcac 660 
cgcaatgcgt gccagcgcct tcccgaccga agatccacag aaacttaaaa cacccgctga 720 
tatcatgccg ctctacctct ggctgatggg cgatgacagc cgccgtaaaa ccggcatgac 780 
ctttgacgcc caaccgggcc gtaaaccagg aatttcccaa tgagtgatga acgctaccaa 84 0 
cagcgtcagc agcgagtgaa agaaaaagta gatgctcgtg tggcccaggc ccaggatgaa 900 
cgcggtatta tcatcgtctt taccggcaat ggaaaaggca aaaccaccgc ggcatttggt 960 
acggcaacac gcgcagttgg tcacggaaaa aaagtaggcg tcgtgcagtt tattaaaggc 1020 
acctggccta atggcgaacg caatctgctg gagccacatg gcgttgagtt tcaggtgatg 1080 
gcaacgggct ttacctggga tacacaaaac cgcgagtctg ataccgccgc ctgccgcgaa 1140 
gtctggcaac atgcaaagcg gatgcttgct gattcctcac tggatatggt tttgcttgat 1200 
gaactgacgt atatggtggc gtatgactat ttgccactgg aagaagtggt gcaggcgtta 1260 
aatgaacgtc cacatcaaca gacggtgatt atcacgggtc gtggttgtca tcgggatatt 1320 
cttgaactgg cagatacggt aagtgaatta cgccccgtca aacatgcgtt tgatgccggt 1380 
gtaaaagcgc agatagggat cgattattaa tctgtaaaaa tgccggatat gacgtccggc 1440 
atactttgat taaccgttat 1460 

<210> 2 

<211> 30 

<212> DNA 

<213> Primer 



<400> 2 

cataaagccc caaagtcata 



aagtacactg 



30 



<210> 
<211> 
<212> 
<213> 



3 

29 

DNA 

Primer 
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<400> 3 

ataacggtta atcaaagtat gccggagtc 

<210> 4 

<211> 33 

<212> DNA 

<213> primer 



29 



<400> 4 

aggaagcttg ccccaaagtc ataaagtaca ctg 33 

<210> 5 

<211> 33 

<212> DNA 

<213> priiaer 

<400> 5 

atactcgagg ttaatcaaag tatgccggac gtc 33 

<210> 6 

<211> 1466 

<212> DNA 

<213> Escherichia coli 



<400> 6 

aggaagcttg 

agtagccatg 

gggagccagc 

agtgattctg 

agaaactggg 

ttgccaacaa 

taatgccgga 

ggacgtcatg 

attactcaaa 

acgagccaac 

actggccgat 

caccgcaatg 

tgatatcatg 

gacctttgac 

caacagcgtc 

gaacgcggta 

ggtacggcaa 

ggcacctggc 

atggcaacgg 

gaagtctggc 

gatgaactga 

ttaaatgaac 

attcttgaac 

ggtgtaaaag 

ggcatacttt 



ccccaaagtc 
cattaccagc 
gatggtattg 
ttgggccgta 
cgtcagccac 
ctggcacagc 
ttgctcggcg 
caggtcaacg 
tcggacgccg 
tggggtgcat 
gaatatcagc 
cgtgccagcg 
ccgctctacc 
gcccaaccgg 
agcagcgagt 
ttatcatcgt 
cacgcgcagt 
ctaatggcga 
gctttacctg 
aacatgcaaa 
cgtatatggt 
gtccacatca 
tggcagatac 
cgcagatagg 
gattaacctc 



ataaagtaca 
caaaacaaga 
gtcgtgaagc 
atgaagaaaa 
agtggtttat 
gcattgccgt 
atgtttgccc 
ttaatgccac 
gttcactggt 
atgcagcgtc 
agcgcctgcg 
ccttcccgac 
tctggctgat 
gccgtaaacc 
gaaagaaaaa 
ctttaccggc 
tggtcacgga 
acgcaatctg 
ggatacacaa 
gcggatgctt 
ggcgtatgac 
acagacggtg 
ggtaagtgaa 
gatcgattat 
gagtat 



ctggcagcgc 
tttactcaat. 
cgcgatgacg 
attacgtcag 
tctcgatttg 
taattatccg 
aatgagcgaa 
ctttatgctc 
ctttacttca 
gaaatttgcc 
tgtcaactgc 
cgaagatcca 
gggcgatgac 
aggaatttcc 
gtagatgctc 
aatggaaaag 
aaaaaagtag 
ct ggagccac 
aaccgcgagt 
gctgattcct 
tatttgccac 
attatcacgg 
ttacgccccg 
taatctgtaa 



gacagacaac 
gatcgcatta 
tatgcacgct 
gtagccagcc 
ctgacctgca 
cgt ctggatg 
caaaatccgc 
acccaggcac 
tcaagcgttg 
accgaaggga 
attaacccag 
cagaaactta 
agccgccgta 
caatgagtga 
gtgtggccca 
gcaaaaccac 
gcgtcgtgca 
atggcgttga 
ctgataccgc 
cactggatat 
tggaagaagt 
gtcgtggttg 
tcaaacatgc 
aaatgccgga 



gtcgttgagg 
tcctggtgac 
atggtgcgac 
acataaacga 
cgtccgaaaa 
gtgttttgca 
aggtctggca 
tgcttccttt 
gacgtcaggg 
tgatgcaggt 
gcggtacgcg 
aaacacccgc 
aaaccggcat 
tgaacgctac 
ggcccaggat 
cgcggcattt 
gtttattaaa 
gtttcaggtg 
cgcctgccgc 
ggttttgctt 
ggtgcaggcg 
tcatcgggat 
gtttgatgcc 
tatgacgtcc 



60 
120 
180 
240 
30O 
360 
420 
480 
540 
60O 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1466 



<210> 7 

<211> 1247 

<212> DNA 

<213> Plasmid 



<400> 7 

gacggtatcg ataagctgga tccataactt 

gagtccgaat aaatacctgt gacggaagat 

cctgttgata ccgggaagcc ctgggccaac 

gtaagaggtt ccaactttca ccataatgaa 

ttatcgagat tttcaggagc taaggaagct 

accgttgata tatcccaatg gcatcgtaaa 

caatgtacct ataaccagac cgttcagctg 



cgtataatgt atgctatacg aagttatcta 60 

cacttcgcag aataaataaa tcctggtgtc 120 

ttttggcgaa aatgagacgt tgatcggcac 180 

ataagatcac taccgggcgt attttttgag 240 

aaaatggaga aaaaaatcac tggatatacc 300 

gaacattttg aggcatttca gtcagttgct 360 

gatattacgg ccttrttaaa gaccgtaaag 420 
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aaaaataagc acaagtttta tccggccttt attcacattc trtgcccgcct gatgaatgct 480 

catccggaat tccgtatggc aatgaaagac ggtgagctgg tgatatggga tagtgttcac 540 

ccttgttaca ccgttttcca tgagcaaact gaaacgtttt catcgctctg gagtgaatac 600 

cacgacgatt tccggcagtt tctacacata tattcgcaag atgtggcgtg ttacggtgaa 660 

aacctggcct atttccctaa agggtttatt gagaatatgt ttttcgtctc agccaatccc 720 

tgggtgagtt tcaccagttt tgatttaaac gtggccaata tggacaactt cttcgccccc 780 

gttttcaccc atgggcaaat attatacgca aggcgacaag gtgctgatgc cgctggcgat 840 

tcaggttcat catgccgtct gtgatggctt ccatgtcggc agaatgctta atgaattaca 900 

acagtactgc gatgagtggc agggcggggc gtaatttttt taaggcagtt attggtgccc 960 

ttaaacgcct ggtgctacgc ctgaataagt gataataagc ggatgaatgg cagaaattcg 1020 

gacctgcagg catgcaactc tagataactti cgtataatgt atgctatacg aagttatgcg 108 0 

gccgccatat gcatcctagg cctattaata ttccggagta tacgtagccg gctaacgttc 1140 

tagcatgcga aatttaaagc gctgatatcg atcgcgcgca gatctgtcat gatgatcatt 1200 

gcaattggat ccatatatag ggcccggggt. tataattacc tcaggtc 1247 

<210> 8 
<211> 252 
<212> PRT 

<213> Escherichia coli 
<400> 8 

Met His Tyr Gin Pro Lys Gin Asp Leu Leu Asn Asp Arg lie lie Leu 
13 10 15 

Val Thr Gly Ala Ser Asp Gly He Gly Arg Glu Ala Ala Met Thr Tyr 
20 25 30 

Ala Arg Tyr Gly Ala Thr Val He Leu Leu Gly Arg Asn Gla Glu Lys 
35 40 45 

Leu Arg Gin Val Ala Ser His He Asn Glu Glu Thr Gly Arg Gin Pro 
50 55 60 

Gin Trp Phe He Leu Asp Leu Leu Thr Cys Thr Ser Glu Asn Cys Gin 
65 70 75 80 

Gin Leu Ala Gin Arg He Ala Val Asn Tyr Pro Arg Leu Asp Gly Val 
85 90 95 

Leu His Asn Ala Gly Leu Leu Gly Asp Val Cys Pro Met Ser Glu Gin 
100 105 110 

Asn Pro Gin Val Trp Gin Asp Val Met Gin Val Asn Val Asn Ala Thr 
115 120 125 

Phe Met Leu Thr Gin Ala Leu Leu Pro Leu Leu Leu Lys Ser Asp Ala 
130 135 140 

Gly Ser Leu Val Phe Thr Ser Ser Ser Val Gly Arg Gin Gly Arg Ala 
145 150 155 160 

Asn Trp Gly Ala Tyr Ala Ala Ser Lys Phe Ala Thr Glu Gly Met Met 
165 170 175 

Gin Val Leu Ala Asp Glu Tyr Gin Gin Arg Leu Arg Val Asn Cys He 
180 185 190 

Asn Pro Gly Gly Thr Arg Thr Ala Met Arg Ala Ser Ala Phe Pro Thr 
195 200 205 

Glu Asp Pro Gla Lys Leu Lys Thr Pro Ala Asp He Met Pro Leu Tyr 
210 215 220 
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Leu Trp Leu Met Gly Asp Asp Ser Arg Arg Lys Thr Gly Met Thr Phe 
22S 230 235 240 

Asp Ala Gin Pro Gly Arg Lys Pro Gly He Ser Gin 
245 250 

<2X0> 9 
<211> 22 
<212> DNA 
<213> primer 

<400> 9 

cagacaacaa ttccacgagt cc 22 

<210> 10 

<211> 22 

<212> DNA 

<213> primer 

<400> 10 

tgaagttgaa cgtgaatatg cc 22 

<210> 11 

<211> 28 

<212> DNA 

<213> primer 

<400> 11 

tcaaacccgg tggtttctcg cgaccggg 28 

<210> 12 

<211> 28 

<212> DNA 

<213> primer 

<400> 12 

ctcagccgga tatcgacggc gcgctggt 28 

<210> 13 

<211> 60 

<212> DNA 

<213> primer 

<400> 13 

accagcgcgc cgtcgatatc cggctgagta ctcaacacct gccagctctt tacgcaggtt 60 

<210> 14 

<211> 28 

<212> DNA 

<213> Primer 

<400> 14 

cagcatgcct gcgaaccaca ggcctatc 28 

<210> 15 

<211> 28 

<212> DNA 

<213> Primer 

<40O> 15 

atgaacaagt ggggcgtagg gttaacat 28 

<210> 16 
<211> 28 
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<212> DNA 
<213> Primer 

<400> 16 

ttaattactt gatttattgt cggcttta 28 

<210> 17 
<211> 2095 
<212> DNA 

<213> Escherichia coli 
<400> 17 

cagacaacaa ttccacgagt ccccaccttg gttaaacaac gctttggctc acagtttaca 60 
acaggcatgt taagccgtct cgcagagatg ttattgcagc gaaagtatga caaagtatcc 120 
attttgcggt gttatcgccc caggctgatt gaaatgccct tctgtttcag gcataaagcc 180 
ccaaagtcat aaagtacact ggcagcgcga cagacaacgt cgttgaggag tagccatgca 240 
ttaccagcca aaacaagatt tactcaatga tcgcattatc ctggtgacgg gagccagcga 300 
tggtattggt cgtgaagccg cgatgacgta tgcacgctat ggtgcgacag tgattctgtt 360 
gggccgtaat gaagaaaaat tacgtcaggt agccagccac ataaacgaag aaactgggcg 420 
tcagccacag tggtttattc tcgatttgct gacctgcacg tccgaaaatt gccaacaact 480 
ggcacagcgc attgccgtta attatccgcg tctggatggt gttttgcata atgccggatt 54 0 
gctcggcgat gtttgcccaa tgagcgaaca aaatccgcag gtctggcagg acgtcatgca 600 
ggtcaacgtt aatgccacct ttatgctcac ccaggcactg cttcctttat tactcaaatc 660 
ggacgccggt tcactggtct ttacttcatc aagcgttgga cgtcagggac gagccaactg 720 
gggtgcatat gcagcgtcga aatttigccac cgaagggatg atgcaggtac tggccgatga 780 
atatcagcag cgcctgcgtg tcaactgcat taacccaggc ggtacgcgca ccgcaatgcg 840 
tgccagcgcc ttcccgaccg aagatccaca gaaacttaaa acacccgctg atatcatgcc 900 
gctctacctc tggctgatgg gcgatgacag ccgccgtaaa accggcatga cctttgacgc 960 
ccaaccgggc cgtaaaccag gaatttccca atgagtgatg aacgctacca acagcgtcag 1020 
cagcgagtga aagaaaaagt agatgctcgt gtggcccagg cccaggatga acgcggtatt 1080 
atcatcgtct ttaccggcaa tggaaaaggc aaaaccaccg cggcatttgg tacggcaaca 1140 
cgcgcagttg gtcacggaaa aaaagtaggc gtcgtgcagt ttatitaaagg cacctggcct 1200 
aatggcgaac gcaatctgct ggagccacat ggcgttgagt ttcaggtgat ggcaacgggc 1260 
tttacctggg atacacaaaa ccgcgagtct gataccgccg cctgccgcga agtctggcaa 1320 
catgcaaagc ggatgcttgc tgattcctca ctggatatgg ttttgcttga tgaactgacg 1380 
tatatggtgg cgtatgacta tttgccactg gaagaagtgg tgcaggcgtt aaatgaacgt X440 
ccacatcaac agacggtgat tatcacgggt cgtggttgtc atcgggatat tcttgaactg 1500 
gcagatacgg taagtgaatt acgccccgtc aaacatgcgt ttgatgccgg tgtaaaagcg 1560 
cagataggga tcgattatta atctgtaaaa atgccggata tgacgtccgg catactttga 1620 
ttaaccgtta ttattacgtc cgccagaacg acgaccgccg ctcacctgac tgtgacgttt 1680 
caccgcacga cgaatctgat tcgccttcat gcgacgacgg tctttttcta cagcgacttt 1740 
agagctggtt tctggcggta gctccaccag ttcgcgcaga tagttagtct gggcgaggtc 1800 
cagctccgtc cagccaccgc gtggcagacc ttttggcagt gggatatcac cgtaacgaac 1860 
acggatcagg cggctcacct gcacaccaac cgcttcccac agacgacgaa cttcacggtt 1920 
acggccttcg gtcagagtca cgttgtacca ctggttgatc ccttcgccgc cgctgaactt 1980 
gatggtttta aaagctgccg gaccatcttc caactgcacg ccacgactca aatcacgcag 2040 
tttcgcgtcg tcaacctgac caaatacacg cacggcatat tcacgttcaa cttca 2095 

<210> 18 

<211> 969 

<212> DNA 

<213> Escherichia coli 

<400> 18 



ggtggcggcc gctctagccc ataaagcccc 
gacaacgtcg ttgaggagta gccatgcatt 
gcattatcct ggtgacggga gccagcgatg 
cacgctatgg tgcgacagtg attctgttgg 
ccagccacat aaacgaagaa actgggcgtc 
cctgcacgtc cgaaaattgc caacaactgg 
tggatggtgt tttgcataat gccggattgc 
atccgcaggt ctggcaggac gtcatgcagg 
aggcactgct tcctttatta ctcaaatcgg 
gcgttggacg tcagggacga gccaactggg 



aaagtcataa agtacactgg cagcgcgaca 60 
accagccaaa acaagattta ctcaatgatc 120 
gtattggt-cg tgaagccgcg atgacgtatg 180 
gccgtaatga agaaaaatta cgtcaggtag 240 
agccacagtg gtttattctc gatttgctga 300 
cacagcgcat tgccgttaat tatccgcgtc 360 
tcggcgatgt ttgcccaatg agcgaacaaa 420 
tcaacgttaa tgccaccttt atgctcaccc 480 
acgccggttc actggtcttt acttcatcaa 540 
gtgcatatgc agcgtcgaaa tttgccaccg 600 
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aagggatgat gcaggtactg gccgatgaat atcagcagcg cctgcgtgtc aactgcatta 
acccaggcgg tacgcgcacc gcaatgcgtg ccagcgcctt cccgaccgaa gatccacaga 
aacttaaaac acccgctgat atcatgccgc tctacctctg gctgatgggc gatgacagcc 
gccgtaaaac cggcatgacc tttgacgccc aaccgggccg taaaccagga atttcccaat 
gagtgatgaa cgctaccaac agcgtcagca gcgagtgaaa gaaaaagtag atgctcgtgt 
ggcccaggcc caggatgaac gcggtattat catcgtcttt accggcaatg gaaaaggcaa 
aaccaccgc 

<210> 19 
<211> 1282 
<212> DNA 

<213> Escherichia coli 
<400> 19 

ggccagttct tctttcatct ccttatactg ttcgctgaga ttgttgaccc gtaactcgcc 60 
acgctggcgt ttattacgtt gagcaacatt gacaataatg gcggcaatcg ccgcaatcgc 120 
tagcacaacg gtaacgattt tcgccaaaaa caaaccatat tcagacaaca attccacgag 180 
tccccacctt ggttaaacaa cgctttggct cacagtttac aacaggcatg ttaagccgtc 240 
tcgcagagat gttattgcag cgaaagtatg acaaagtatc cattttgcgg tgttatcgcc 300 
ccaggctgat tgaaatgccc ttctgtttca ggcataaagc cccaaagtca taaagtacac 360 
tggcagcgcg acagacaacg tcgttgagga gtagccatgc attaccagcc aaaacaagat 420 
ttactcaatg atcgcattat cctggtgacg ggagccagcg atggtattgg tcgtgaagcc 480 
gcgatgacgt atgcacgcta tggtgcgaca gtgattctgt tgggccgtaa tgaagaaaaa 540 
ttacgtcagg tagccagcca cataaacgaa gaaactgggc gtcagccaca gtggtttatt 60O 
ctcgatttgc tgacctgcac gtccgaaaat tgccaacaac tggcacagcg cattgccgtt 660 
aattatccgc gtctggatgg tgttttgcat aatgccggat tgctcggcga tgtttgccca 720 
atgagcgaac aaaatccgca ggtctggcag gacgtcatgc aggtcaacgt taatgccacc 780 
tttatgctca cccaggcact gcttccttta ttactcaaat cggacgccgg ttcactggtc 840 
tttacttcat caagcgttgg acgtcaggga cgagccaact ggggtgcata tgcagcgtcg 900 
aaatttgcca ccgaagggat gatgcaggta ctggccgatg aatatcagca gcgcctgcgt 960 
gtcaactgca ttaacccagg cggtacgcgc accgcaatgc gtgccagcgc cttcccgacc 1020 
gaagatccac agaaacttaa aacacccgct gatatcatgc cgctctacct ctggctgatg 1080 
ggcgatgaca gccgccgtaa aaccggcatg acctttgacg cccaaccggg ccgtaaacca 1140 
ggaattrtccc aatgagtgat gaacgctacc aacagcgtca gcagcgagtg aaagaaaaag 1200 
tagatgctcg tgtggcccag gcccaggatg aacgcggtat tatcatcgtc tttaccggca 1260 
atggaaaagg caaaaccacc gc 1282 
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